A Modular Visual Tracking System
نویسنده
چکیده
I am currently building an active visual tracking system for a real world robot. The hardware is being built at MIT under the supervision of Professors Rod Brooks and Lynn Andrea Stein, and is humanoid in form. The software is also humanoid: I am basing its organization on models of early vision in the human brain. Most of the software is still in the design phase; what I describe here is the part of the system that is already up and running. The robot, named Cog, has roughly the same degrees of freedom in the waist, neck, arms and eyes as a human and is designed with similar proportions in mind. The eyes sport a simulated fovea each eye consists of two cameras mounted in the same plate, one with a wide field of view, and one with a much narrower view. Both cameras produce 128 x 128 gray scale images. The narrow one is used for tracking, and can be used for object recognition, while the wider view will be used for motion detection and peripheral vision. I have written a visual tracking system that will eventually be hooked up to the motors in Cog’s neck and eyes. For now, tracking is simulated by moving a small 16 x 16 “attention window” around the larger image from one of the cameras. On startup, the system memorizes the 16 x 16 segment in the center of the full image as a “reference” image. For the rest of the run, the system moves the window around to maintain whatever was initially present within its view. The tracking system runs as follows. Once a new frame is grabbed, the system makes a guess about where the new window location should be, based on its previous velocity. Next, a portion of the image slightly larger than the attention window is selected, and the derivative of this region is computed.’ Finally, a simple correlation is performed between the memorized reference image and the nine 16 x 16 windows centered around the pixel position nearest the guess. The one
منابع مشابه
Modular Neuromorphic VLSI Architectures for Visual Motion and Target Tracking
Modern age intelligent systems will require extensive computational power, complex parallel processing units, and low-power design. Biologically inspired neuromorphic VLSI systems present a viable solution to the demands of both highly parallel, and low-power consuming processors. Among biological sensory systems, vision is the most important one with the largest portion of the brain devoted to...
متن کاملITrackU: An Integrated Framework for Image-based Tracking and Understanding
Within the ITrackU project, a modular software architecture for model-based visual tracking and image understanding is being developed. The library is general-purpose with respect to object models, state-space parameters, visual modalities employed, number of cameras and targets, and tracking methodology. This provides the necessary building blocks for a seamless integration of a wide variety o...
متن کاملVISP: A Software Environment for Eye-in-Hand Visual Servoing
In this paper, we describe a modular software that allows fast development of eye-in-hand image-based visual servoing applications (ViSP states for “Visual Servoing Platform”). Visual servoing consists in specifying a task as the regulation in the image of a set of visual features. Various issues have thus to considered in the design of such application: among these issues we find the control o...
متن کاملVisual Tracking using Kernel Projected Measurement and Log-Polar Transformation
Visual Servoing is generally contained of control and feature tracking. Study of previous methods shows that no attempt has been made to optimize these two parts together. In kernel based visual servoing method, the main objective is to combine and optimize these two parts together and to make an entire control loop. This main target is accomplished by using Lyapanov theory. A Lyapanov candidat...
متن کاملVisp: a Software Environment for Eye-in-hand Visual Servoing 2 Image Based Visual Servoing
In this paper, we describe a modular software that allows fast development of eye-in-hand image-based visual servoing applications (ViSP states for \Visual Servoing Platform"). Visual servoing consists in specifying a task as the regulation in the image of a set of visual features. Various issues have thus to considered in the design of such application: among these issues we nd the control of ...
متن کاملVisual Tracking using Learning Histogram of Oriented Gradients by SVM on Mobile Robot
The intelligence of a mobile robot is highly dependent on its vision. The main objective of an intelligent mobile robot is in its ability to the online image processing, object detection, and especially visual tracking which is a complex task in stochastic environments. Tracking algorithms suffer from sequence challenges such as illumination variation, occlusion, and background clutter, so an a...
متن کامل